AITopics | converge weakly

Collaborating Authors

converge weakly

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Asymptotic and Finite-Time Guarantees for Langevin-Based Temperature Annealing in InfoNCE

Chaudhry, Faris

arXiv.org Machine LearningMar-16-2026

The InfoNCE loss in contrastive learning depends critically on a temperature parameter, yet its dynamics under fixed versus annealed schedules remain poorly understood. We provide a theoretical analysis by modeling embedding evolution under Langevin dynamics on a compact Riemannian manifold. Under mild smoothness and energy-barrier assumptions, we show that classical simulated annealing guarantees extend to this setting: slow logarithmic inverse-temperature schedules ensure convergence in probability to a set of globally optimal representations, while faster schedules risk becoming trapped in suboptimal minima. Our results establish a link between contrastive learning and simulated annealing, providing a principled basis for understanding and tuning temperature schedules.

artificial intelligence, convergence, machine learning, (18 more...)

arXiv.org Machine Learning

2603.12552

Country: Asia > Middle East > Jordan (0.04)

Genre: Research Report > New Finding (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.69)

Add feedback

eefc9e10ebdc4a2333b42b2dbb8f27b6-Supplemental.pdf

Neural Information Processing SystemsFeb-11-2026, 00:52:27 GMT

converge weakly, convergence, weak convergence, (15 more...)

Neural Information Processing Systems

Country:

North America > Canada (0.14)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)
North America > United States > Texas > Brazos County > College Station (0.04)
(4 more...)

Genre: Research Report > New Finding (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.68)

Add feedback

The Scaling Limit of High-Dimensional Online Independent Component Analysis

Chuang Wang, Yue Lu

Neural Information Processing SystemsNov-21-2025, 07:41:02 GMT

These solutions provide detailed information about the performance of the ICA algorithm, as many practical performance metrics are functionals of the joint empirical measures.

algorithm, artificial intelligence, machine learning, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > California > Los Angeles County > Long Beach (0.04)
Asia > Middle East > Jordan (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

Discovering Causal Relationships using Proxy Variables under Unmeasured Confounding

Wu, Yong, Fu, Yanwei, Wang, Shouyan, Wang, Yizhou, Sun, Xinwei

arXiv.org Machine LearningOct-21-2025

Inferring causal relationships between variable pairs in the observational study is crucial but challenging, due to the presence of unmeasured confounding. While previous methods employed the negative controls to adjust for the confounding bias, they were either restricted to the discrete setting (i.e., all variables are discrete) or relied on strong assumptions for identification. To address these problems, we develop a general nonparametric approach that accommodates both discrete and continuous settings for testing causal hypothesis under unmeasured confounders. By using only a single negative control outcome (NCO), we establish a new identification result based on a newly proposed integral equation that links the outcome and NCO, requiring only the completeness and mild regularity conditions. We then propose a kernel-based testing procedure that is more efficient than existing moment-restriction methods. We derive the asymptotic level and power properties for our tests. Furthermore, we examine cases where our procedure using only NCO fails to achieve identification, and introduce a new procedure that incorporates a negative control exposure (NCE) to restore identifiability. We demonstrate the effectiveness of our approach through extensive simulations and real-world data from the Intensive Care Data and World Values Survey.

artificial intelligence, equation, machine learning, (17 more...)

arXiv.org Machine Learning

2510.17167

Genre:

Research Report > Experimental Study (1.00)
Research Report > Strength High (0.92)

Industry:

Health & Medicine > Therapeutic Area (1.00)
Health & Medicine > Health Care Providers & Services (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Data Science (0.67)

Add feedback

The sequence of distributions that converges weakly to π

Neural Information Processing SystemsOct-3-2025, 00:58:12 GMT

We are very grateful to all the reviewers for their thoughtful feedback. All typos and minor points will also be fixed. Prop. 3 implies that any inference problem can be decomposed into a sequence of Another consideration, as highlighted by the example of 4.3, is that reducing the Bayesian computation, as the two methods have different computational cost patterns. This is required for each optimization step as well. Currently, however, we haven't found problems where the basis derived from H In the discussion after Prop. 1, we should have The phrase "lack of precision" in 4.4 refers to the finite number of samples drawn from

artificial intelligence, machine learning, sequence, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.38)

Add feedback

Asymptotic behavior of eigenvalues of large rank perturbations of large random matrices

Afanasiev, Ievgenii, Berlyand, Leonid, Kiyashko, Mariia

arXiv.org Artificial IntelligenceAug-25-2025

Random Matrix Theory (RMT) is a classical theory that has been developing for more than 70 years. Initially, RMT arose from problems in nuclear physics and found its applications in mathematics, physics, finance, and many other disciplines. Recently, new problems have been arising from the area of Machine Learning. Indeed, often the weight matrices of Deep Neural Networks (DNNs) are initialized randomly. Moreover, modern DNNs have large weight matrices, which is why their spectral properties can be described by asymptotic behavior of N N random matrices as N goes to infinity.

artificial intelligence, machine learning, matrix, (19 more...)

arXiv.org Artificial Intelligence

2507.12182

Country:

Europe > Ukraine (0.47)
North America > United States (0.28)

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)

Add feedback

Asymptotic Behaviors of Projected Stochastic Approximation: A Jump Diffusion Perspective

Neural Information Processing SystemsAug-19-2025, 12:42:16 GMT

In this paper we consider linearly constrained stochastic approximation problems with federated learning as a special case.

artificial intelligence, machine learning, theorem 3, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Virginia (0.04)
Asia > Middle East > Jordan (0.04)
Asia > China (0.04)

Genre: Research Report (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)

Add feedback

Statistical and Topological Properties of Sliced Probability Divergences S

Neural Information Processing SystemsAug-17-2025, 04:55:35 GMT

We can now prove Theorem 1. Proof of Theorem 1. Now, let us prove the other implication, i.e. Theorem 2. Our result is thus consistent with the existing results in the literature. Next, we show that this result holds for two popular choices of kernels. We conclude that k ˆ k is positive definite, hence (S17) holds for RBF kernels.S1.4 Proof of Theorem 3 Proof of Theorem 3. We start by upper bounding the distance between two regularized measures. The desired result is obtained as a direct application of Theorems 2 and 3.S1.6

artificial intelligence, convergence, machine learning, (17 more...)

Neural Information Processing Systems

Country:

North America > Canada (0.14)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)
North America > United States > Texas > Brazos County > College Station (0.04)
(4 more...)

Genre: Research Report > New Finding (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.68)

Add feedback

6fb83b240844d0e3eb8d457072a071ad-Supplemental-Conference.pdf

Neural Information Processing SystemsAug-15-2025, 17:39:02 GMT

artificial intelligence, kx 0, machine learning, (17 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.94)

Add feedback

Control, Optimal Transport and Neural Differential Equations in Supervised Learning

Phung, Minh-Nhat, Tran, Minh-Binh

arXiv.org Artificial IntelligenceMar-19-2025

From the perspective of control theory, neural differential equations (neural ODEs) have become an important tool for supervised learning. In the fundamental work of Ruiz-Balet and Zuazua (SIAM REVIEW 2023), the authors pose an open problem regarding the connection between control theory, optimal transport theory, and neural differential equations. More precisely, they inquire how one can quantify the closeness of the optimal flows in neural transport equations to the true dynamic optimal transport. In this work, we propose a construction of neural differential equations that converge to the true dynamic optimal transport in the limit, providing a significant step in solving the formerly mentioned open problem.

artificial intelligence, equation, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2503.15105

Country: North America > United States > Texas > Brazos County > College Station (0.14)

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback